NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Enhancing LLM-Based Short Answer Grading with Retrieval-Augmented Generation

https://doi.org/10.5281/zenodo.15870304

Chu, Yucheng; He, Peng; Li, Hang; Han, Haoyu; Yang, Kaiqi; Xue, Yu; Li, Tingting; Krajcik, Joseph; Tang, Jiliang (July 2025, International Educational Data Mining Society)
Mills, Caitlin; Alexandron, Giora; Taibi, Davide; Lo_Bosco, Giosuè; Paquette, Luc (Ed.)
Short answer assessment is a vital component of science education, allowing evaluation of students' complex three-dimensional understanding. Large language models (LLMs) that possess human-like ability in linguistic tasks are increasingly popular in assisting human graders to reduce their workload. However, LLMs' limitations in domain knowledge restrict their understanding in task-specific requirements and hinder their ability to achieve satisfactory performance. Retrieval-augmented generation (RAG) emerges as a promising solution by enabling LLMs to access relevant domain-specific knowledge during assessment. In this work, we propose an adaptive RAG framework for automated grading that dynamically retrieves and incorporates domain-specific knowledge based on the question and student answer context. Our approach combines semantic search and curated educational sources to retrieve valuable reference materials. Experimental results in a science education dataset demonstrate that our system achieves an improvement in grading accuracy compared to baseline LLM approaches. The findings suggest that RAG-enhanced grading systems can serve as reliable support with efficient performance gains.
more » « less
Free, publicly-accessible full text available July 20, 2026
Aligning large language models and geometric deep models for protein representation

Shu, Dong; Duan, Bingbing; Guo, Kai; Zhou, Kaixiong; Tang, Jiliang; Du, Mengnan (May 2025, Patterns)

Free, publicly-accessible full text available May 9, 2026
A LLM-Powered Automatic Grading Framework with Human-Level Guidelines Optimization

https://doi.org/10.5281/zenodo.15870201

Chu, Yucheng; Li, Hang; Yang, Kaiqi; Shomer, Harry; Copur-Gencturk, Yasemin; Kaldaras, Leonora; Haudek, Kevin; Krajcik, Joseph; Shin, Namsoo; Liu, Hui; et al (July 2025, International Educational Data Mining Society)
Mills, Caitlin; Alexandron, Giora; Taibi, Davide; Lo_Bosco, Giosuè; Paquette, Luc (Ed.)
Open-text responses provide researchers and educators with rich, nuanced insights that multiple-choice questions cannot capture. When reliably assessed, such responses have the potential to enhance teaching and learning. However, scaling and consistently capturing these nuances remain significant challenges, limiting the widespread use of open-text questions in educational research and assessments. In this paper, we introduce and evaluate GradeOpt, a unified multiagent automatic short-answer grading (ASAG) framework that leverages large language models (LLMs) as graders for short-answer responses. More importantly, GradeOpt incorporates two additional LLM-based agents—the reflector and the refiner—into the multi-agent system. This enables GradeOpt to automatically optimize the original grading guidelines by performing self-reflection on its errors. To assess GradeOpt's effectiveness, we conducted experiments on two representative ASAG datasets, which include items designed to capture key aspects of teachers' pedagogical knowledge and students' learning progress. Our results demonstrate that GradeOpt consistently outperforms representative baselines in both grading accuracy and alignment with human evaluators across different knowledge domains. Finally, comprehensive ablation studies validate the contributions of GradeOpt's individual components, confirming their impact on overall performance.
more » « less
Free, publicly-accessible full text available July 12, 2026
LPFormer: An Adaptive Graph Transformer for Link Prediction

https://doi.org/10.1145/3637528.3672025

Shomer, Harry; Ma, Yao; Mao, Haitao; Li, Juanhui; Wu, Bo; Tang, Jiliang (August 2024, ACM)

Full Text Available
Uncovering Problematic Designs Hindering Ubiquitous Cellular Emergency Services Access

https://doi.org/10.1145/3636534.3690704

Hu, Yiwen; Chen, Min-Yue; Yan, Haitian; Cheng, Chuan-Yi; Tu, Guan-Hua; Li, Chi-Yu; Xie, Tian; Peng, Chunyi; Xiao, Li; Tang, Jiliang (December 2024, ACM)

Full Text Available
Position: Graph Foundation Models are Already Here

Mao, Haitao; Chen, Zhikai; Tang, Wenzhuo; Zhao, Jianan; Ma, Yao; Zhao, Tong; Shah, Neil; Galkin, Mikhail; Tang, Jiliang (July 2024, ICML)

Graph Foundation Models (GFMs) are emerging as a significant research topic in the graph domain, aiming to develop graph models trained on extensive and diverse data to enhance their applicability across various tasks and domains. Developing GFMs presents unique challenges over traditional Graph Neural Networks (GNNs), which are typically trained from scratch for specific tasks on particular datasets. The primary challenge in constructing GFMs lies in effectively leveraging vast and diverse graph data to achieve positive transfer. Drawing inspiration from existing foundation models in the CV and NLP domains, we propose a novel perspective for the GFM development by advocating for a ``graph vocabulary'', in which the basic transferable units underlying graphs encode the invariance on graphs. We ground the graph vocabulary construction from essential aspects including network analysis, expressiveness, and stability. Such a vocabulary perspective can potentially advance the future GFM design in line with the neural scaling laws. All relevant resources with GFM design can be found here.
more » « less
Full Text Available
Demystifying Structural Disparity in Graph Neural Networks: Can One Size Fit All?

Mao, Haitao; Chen, Zhikai; Jin, Wei; Han, Haoyu; Ma, Yao; Zhao, Tong; Shah, Neil; Tang, Jiliang (December 2023, NeurIPS)

Recent studies on Graph Neural Networks(GNNs) provide both empirical and theoretical evidence supporting their effectiveness in capturing structural patterns on both homophilic and certain heterophilic graphs. Notably, most real-world homophilic and heterophilic graphs are comprised of a mixture of nodes in both homophilic and heterophilic structural patterns, exhibiting a structural disparity. However, the analysis of GNN performance with respect to nodes exhibiting different structural patterns, e.g., homophilic nodes in heterophilic graphs, remains rather limited. In the present study, we provide evidence that Graph Neural Networks(GNNs) on node classification typically perform admirably on homophilic nodes within homophilic graphs and heterophilic nodes within heterophilic graphs while struggling on the opposite node set, exhibiting a performance disparity. We theoretically and empirically identify effects of GNNs on testing nodes exhibiting distinct structural patterns. We then propose a rigorous, non-i.i.d PAC-Bayesian generalization bound for GNNs, revealing reasons for the performance disparity, namely the aggregated feature distance and homophily ratio difference between training and testing nodes. Furthermore, we demonstrate the practical implications of our new findings via (1) elucidating the effectiveness of deeper GNNs; and (2) revealing an over-looked distribution shift factor on graph out-of-distribution problem and proposing a new scenario accordingly.
more » « less
Full Text Available
Single-Cell Multimodal Prediction via Transformers

https://doi.org/10.1145/3583780.3615061

Tang, Wenzhuo; Wen, Hongzhi; Liu, Renming; Ding, Jiayuan; Jin, Wei; Xie, Yuying; Liu, Hui; Tang, Jiliang (October 2023, CIKM 2023)
How does the Memorization of Neural Networks Impact Adversarial Robust Models?

https://doi.org/10.1145/3580305.3599381

Xu, Han; Liu, Xiaorui; Wang, Wentao; Liu, Zitao; Jain, Anil K; Tang, Jiliang (August 2023, ACM)
Toward Degree Bias in Embedding-Based Knowledge Graph Completion

https://doi.org/10.1145/3543507.3583544

Shomer, Harry; Jin, Wei; Wang, Wentao; Tang, Jiliang (April 2023, ACM)

« Prev Next »

Search for: All records